IN THE CLAIMS 

This listing of tlie claim will replace all prior versions and listings of claim in 
the present application. 
Listing of Claims 

1 . (currently amended)A similar document search method of searching for 
a s i m i lar document similar to a sp e c i f ie d seeds documen t from a document 
database, the method , comprising the steps of : 

a first extracting step of extracting at least one characteristic word cand i dat e 
as a candidate for a charact e rist i c word f rom a -the seeds document including desired 
retrieval contents; 

a second extracting step of extracting as characteristic words of the seeds 
document. wheR-jlthe characteristic word candidat e extracted by said -the f irst 
extracting step is a compound characteristic word i nc l ud i ng constructed by a plurality 
of constituent characteristic words, the compound characteristic word and the 
constituent characteristic words included in the compound characteristic word4Fem 
the charact e r is tic word cand i d a t e; 

a step of calculating, according to the characteristic words extracted by esM 
the second extracting step, similarity between the seeds document and a r e gistrat i on 
docum e nt the document stored on the document database, by using the 
characteristic words including the compound characteristic word and the constituent 
characteristic words by which the compound characteristic word is constructed : and 

a step of outputting a retrieval result as a result of the similarity calculated by 
sai€t -the similarity calculating step. 
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2. (currently amended)A similar document search method according to 
claim 1, wherein said -the second extracting step includes a step of determining, 
when constituent characteristic word information indicating that constituent 
characteristic words ef-is_the characteristic word is registered to the characteristic 
word corr e sponding to th e character i stic word cand i dat e e xtract e d by sai d f i r s t 
oxtract i ng st e p , that the characteristic word candidat e is a compound characteristic 
word. 

3. (currently amended)A similar document search method according to 
claim 1 , wherein said -the similaritv calculating step includes the steps of : 

a step of calculating a weighting coefficient corresponding to a distance 
calculated by term appearance position on the seeds document between a 
constituent characteristic word and another constituent characteristic word which are 
extracted from one compound characteristic word; and 

a step of calculating similarity by multiplying the weighting coefficient. 

4. (currently amended)A similar document search system for searching 
foF-a sim i lar d ocument similar to a sp e c i fi e d seeds documen t from a document 
database, the system , comprising: 

a document analyzer processor for extracting at least one characteristic word 
cand i dat e a s a cand i date for a characteristic word f rom athe seeds document 
including desired retrieval contents; 

a characteristic word extractor processor for extracting as characteristic words 
of the seeds document, whee-iLthe characteristic word cand i dat e extracted by saM 
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the document analyzer processor is a compound characteristic word i nclud i ng 
constructed by a plurality of constituent characteristic words, the compound 
characteristic word and the constituent characteristic words included in the 
compound characteristic word from th e charact e rist i c word cand i dat e; 

a seeds document similarity calculator processor for calculating, according to 
the characteristic words extracted by said -the characteristic word extractor 
processor, similarity between the seeds document and a registration docum e n t the 
document stored on the document database, by using the characteristic words 
including the compound characteristic word and the constituent characteristic words 
by which the compound characteristic word is constructed : and 

a retrieval result output processor for outputting a retrieval result as a result of 
the similarity calculated by said -the seeds document similarity calculator processor. 

5. (currently amended)A similar document search system according to 
claim 4, wherein said -the characteristic word extractor processor includes a 
compound characteristic word determiner processor for determining, when 
constituent characteristic word information indicating that constituent characteristic 
words efis the characteristic word is registered to the characteristic word 
corr e sponding to th e charact e ristic word candidat e e xtract e d by sa i d docum e nt 
analyzer proc es sor , that the characteristic word cand i date is a compound 
characteristic word. 

6. (currently amended)A similar document search system according to 
claim 4, wherein said -the seeds document similarity calculator processor includes: 
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a weighting coefficient calculator processor for calculating a weighting 
coefficient corresponding to a distance calculated by term appearance position on 
the seeds document between a constituent characteristic word and another 
constituent characteristic word which are extracted from one compound 
characteristic word; and 

a calculator processor for calculating similarity by multiplying the weighting 
coefficient. 

7. (currently amended)A program product for making a computer operate 
as a similar document search system for searching fof-a sim il ar document similar to 
a spec i f ie d seeds documen t from a document database , the program comprising: 

a document analyzer processor program for extracting at least one 
characteristic word cand i dat e a s a cand i dat e for a charact e rist i c word f rom a-the 
seeds document including desired retrieval contents; 

a characteristic word extractor processor program for extracting as 
characteristic words of the seeds document, when-if_the characteristic word 
cand i d a t e extracted by said -the document analyzer processor program is a 
compound characteristic word i nc l uding constructed by a plurality of constituent 
characteristic words, the compound characteristic word and the constituent 
characteristic words included in the compound characteristic word from th e 
charact e r is tic word candidat e; 

a seeds document similarity calculator processor program for calculating, 
according to the characteristic words extracted by said -the characteristic word 
extractor processor program, similarity between the seeds document and-a 



rogictrat i on docum e n t the document stored on the document database, by using the 
characteristic words including the compound characteristic word and the constituent 
characteristic words by which the compound characteristic word is constructed : and 

a retrieval result output processor program for outputting a retrieval result as a 
result of the similarity calculated by sald -the seeds document similarity calculator 
processor program. 

8. (currently amended)A program product for making a computer operate 
as a similar document search system according to claim 7, wherein sald-the 
characteristic word extractor processor program includes a compound characteristic 
word determiner processor program for determining, when constituent characteristic 
word information indicating that constituent characteristic words ef-is_the 
characteristic word is registered to the characteristic word corr e spond i ng to th e 
charact e ristic word candidate extract e d by said docum e nt analyz e r proc e s s or 
program , that the characteristic word candidate is a compound characteristic word. 

9. (currently amended)A program product for making a computer operate 
as a similar document search system according to claim 7, wherein sai4 -the seeds 
document similarity calculator processor program includes: 

a weighting coefficient calculator processor program for calculating a 
weighting coefficient corresponding to a distance calculated by term appearance 
position on the seeds document between a constituent characteristic word and 
another constituent characteristic word which are extracted from one compound 
characteristic word; and 



a calculator processor program for calculating similarity by multiplying the 
weighting coefficient. 
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